SPD—a web-based secreted protein database

نویسندگان

  • Yunjia Chen
  • Yong Zhang
  • Yanbin Yin
  • Ge Gao
  • Songgang Li
  • Ying Jiang
  • Xiaocheng Gu
  • Jingchu Luo
چکیده

With the improved secreted protein prediction approach and comprehensive data sources, including Swiss-Prot, TrEMBL, RefSeq, Ensembl and CBI-Gene, we have constructed secretomes of human, mouse and rat, with a total of 18 152 secreted proteins. All the entries are ranked according to the prediction confidence. They were further annotated via a proteome annotation pipeline that we developed. We also set up a secreted protein classification pipeline and classified our predicted secreted proteins into different functional categories. To make the dataset more convincing and comprehensive, nine reference datasets are also integrated, such as the secreted proteins from the Gene Ontology Annotation (GOA) system at the European Bioinformatics Institute, and the vertebrate secreted proteins from Swiss-Prot. All these entries were grouped via a TribeMCL based clustering pipeline. We have constructed a web-based secreted protein database, which has been publicly available at http://spd.cbi.pku.edu.cn. Users can browse the database via a GO assignment or chromosomal-location-based interface. Moreover, text query and sequence similarity search are also provided, and the sequence and annotation data can be downloaded freely from the SPD website.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

FunSecKB: the Fungal Secretome KnowledgeBase

The Fungal Secretome KnowledgeBase (FunSecKB) provides a resource of secreted fungal proteins, i.e. secretomes, identified from all available fungal protein data in the NCBI RefSeq database. The secreted proteins were identified using a well evaluated computational protocol which includes SignalP, WolfPsort and Phobius for signal peptide or subcellular location prediction, TMHMM for identifying...

متن کامل

Visualizing Multiple System Atrophy Studies Based on Collaboration Network and Centrality Indices in Web of Science Database

Introduction: Social network analysis is an analytical method based on graph theories that identifies relationships between individuals or factors to analyze the social structures resulted from those relationships. The objective of this study was to analyze co-authorship and co-word networks based on scientometric indicators and centrality measures in the studies on multiple atrophy system dise...

متن کامل

Effective—a database of predicted secreted bacterial proteins

Protein secretion is a key virulence mechanism of pathogenic and symbiotic bacteria, which makes the investigation of secreted proteins ('effectors') crucial for understanding the molecular bacterium-host interactions. Effective (http://effectors.org) is a database of predicted bacterial secreted proteins, implementing two complementary prediction strategies for protein secretion: the identific...

متن کامل

Cloning rhoptry protein 1 (ROP1) gene of Toxoplasma gondii (RH) in expression vector

  Toxoplasma gondii contain various immunogenic antigens. The most important Toxoplasma antigens are somatic and excreted/secreted antigens. Rhoptry proteins are known as excreted/secreted antigens. These antigens have been proposed as a vaccine candidate against toxoplasmosis. The main objective of the present work was cloning rhoptry protein1 (ROP1) Gene of Toxoplasma gondii (RH) in a cloning...

متن کامل

Soybean Proteome Database 2012: update on the comprehensive data repository for soybean proteomics

The Soybean Proteome Database (SPD) was created to provide a data repository for functional analyses of soybean responses to flooding stress, thought to be a major constraint for establishment and production of this plant. Since the last publication of the SPD, we thoroughly enhanced the contents of database, particularly protein samples and their annotations from several organelles. The curren...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Nucleic Acids Research

دوره 33  شماره 

صفحات  -

تاریخ انتشار 2005